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TURBO DETECTION OF SPACE-TIME CODES 



RELATED APPLICATIONS 

This document claims priority to and incorporates by reference 
copending provisional USSN 60 / 152.982 entitled "Turbo Space-Time 
Processing to Improve Wireless Channel Capacity" filed on September 9. 
1999. 

FIELD OF INVENTION 

The present invention relates to systems and processes for radio 
communications using multiple-element antenna array technology. 

BACKGROUND 
Turbo processing and space-time equalization are terms that 
comprehend several conventional ways to increase wireless channel capacity. 
Generally, turbo coding and/or processing refers to techniques aimed at 
approaching the Shannon limit in a channel, while space-time processing 
refers to techniques for processing signals from multi-element antenna arrays 
to exploit the multi-path nature of fading wireless environments. 

European patent application no. EP 817 401 A2 published July 1. 1998 
in the name of Foschini discloses the use of a number of processing layers for 
space time processing of signals from multiple-receiver antenna elements. 
There, the transmitter feeds a number of transmitter antenna elements by 
cyclically apportioning segments of the modulated encoded stream of data to 
transmitter antenna elements. At the receiver, a number of receiver antenna 
elements are coupled to a number of processing layers, in order to perfonn 
the space-time processing. Signal components received during respective 
periods of time over a plurality of the receive antenna elements are formed 
into respective space and time relationships in which space is associated with 
respective transmitter antenna elements. Preprocessing occurs so ttiat a 
collection of signal components having the same space-time relationship 
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forms a signal vector such that particular decoded signal contributions can t>e 
subtracted from the signal vector while particular undecoded contributions can 
be nulled out of the signal vector. The resulting vector is then supplied to a 
decoder for decoding to reform the data stream. Such conventional systems 
and techniques are further described in document: -ferred to in the "Detailed 
Description" section of this document 

SUMMARY OF THE INVENTION 

Systems and processes according to the present invention employ a 
number of transmitter antenna elements and a number of receiver antenna 
elements coupled to multiple space-time processing layers in the receiver. In 
the present invention, however, portions of the information stream being 
communicated can be interleaved among transmitter antenna elements such 
as on a random or pseudo random basis; among other things, such 
interieaving decreases decision errors in the space-time equalization process. 
Furthennore. each processing layer preferably includes turiao processing in 
order to feed soft decisions about information being processed back to the 
equalizers. Moreover, space-time equalization processes according to the 
present invention preferably seek to maximize signal to noise ratio rather than 
zero forcing, as well as reduce multi-path effects and intersymbol interference. 
A preferred process uses minimum mean square error processing which 
allows the Shannon limit actually to be achieved. Furthermore, systems and 
processes according to the present Invention preferably allow selection of the 
number and identity of receiver antenna elements to which the receiver may 
be coupled in order to optimize performance. 

According to one embodiment of the invention, an information source is 
coupled to provide a plurality of data streams to a plurality of transmit 
antennas, via. for each stream, an encoder, interieaver and symbol mapper. 
On the receiver side, a plurality of M receiver elemer^ are coupled to a 
plurality of processing layers. The number of receiver antenna elements M is 
preferably greater than or equal to the number N of transmit antenna 
elements, since equalization according to the present invention does not 
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require an extra degree of freedom. The M receiver antenna elements are 
coupled to the first processing layer by coupling to a space-time equalizer 
which preferably applies minimum mean square error processing In order to 
maximize signal to noise ratio. The output of the equalizer is applied to a 
deinterleaver. after which the deinterfeaved stream is supplied to a decoder in 
the layer. Output of the decoder is provided for output common with the 
output from the other decoders in the other layers. Preferably, each layer also 
includes an interleaver which receives output from the decoder and 
deinterleaver and supplies its interleaved output back to the equalizer in the 
layer in order to provide soft decision making to the equalizer. In successive 
processing layers, output from the decoder of the preceding layer is combined 
with infomiation from the interference canceler of the layer preceding the 
preceding layer (except the second layer, which receives signals from an 
interference canceller which is coupled to the decoder of the first layer and to 
the receive antenna elements). 

According to an alternate embodiment, the deinterleaver, Interleaver 
and decoder are shared among layers, so that the equalizer of each layer 
outputs to a deinterleaver common to all layers. The output of the 
deinterieaver may then be coupled to a decoder which again is common to all 
layers. An interleaver may be provided which receives output from the 
deinterleaver and the decoder and applies it to each equalizer for soft 
decisions to be applied to the equalizers. 

Accordingly, components for deinterleaving. decoding and 
reinterleaving may be functionally located in each layer, or common to the 
layers. In the first case, each layer below the first layer processes signals 
from an interference canceller which receives signals from a decoder in the 
preceding layer and from the antenna elements (in the case of the second 
layer) or the Interference canceller in the next-preceding layer (in the case of 
other layers). In the second case, each layer below the first processes 
signals from an interference canceller which receives signals from the 
equalizer in the preceding layer and from the antenna elements (in the case of 
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the second layer) or the interference canceller in the next preceding layer (in 
the case of other layers). Such turbo processing architectures can be used in 
connection with layered space-time equalization which relies on zero forcing 
rather than minimum square error processing. They can also be used in multi 
array systems in which the data streams are periodically-cycled rather than 
interleaved. 

It is accordingly an object of the present invention to provide improved 
layer space-time processing for communication systems which employ turbo 
processing techniques in order, among other things, to reduce decision enters. 

It is an additional object of the present invention to provide layered 
space-time processing for communication systems which seeks to maximize 
signal to noise ratio, thereby better addressing the Shannon limit, and which 
also addresses mulit-path effects and / or intersymbol interference. 

It is an additional object of the present invention to provide processing 
for communication systems in which data streams may be interleaved rather 
than periodically cycled among transmit antenna elements, in order, among 
other things, to reduce dedsion errors. 

It is an additional object of the present invention to provide layered 
space-time processing for communication systems in which a receiver can 
select a set of antenna elements. Including their number and / or Identity, from 
among a larger group of antenna elements in order to optimize performance 
of the system. 

Other objects, features, and advantages of present invention will 
become apparent with respect to the remainder of this document 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1(a) is a schematic diagram showing a first embodiment of 
communications systems according to the present invention. 

Fig. 1(b) is a schematic diagram showing a second embodiment of 
communications systems according to the present Invention. 

Fig. 2 is schematic diagram showing one form of space-time 
processing according to the present invention. 
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Figs. 3(a) and 3(b) are diagrams which compare performance between 
two coding schemes according to the present invention. 

Rg. 4 is a diagram which shows different capacity bounds for 
processing according to the present invention over a flat Rayleigh fading 
channel. 

Fig. 5 is a diagram which shows simulation results for a system 
according to the first embodiment of the present invention with two transmit 
and two receive antenna elements. 

Fig. 6 is a diagram which shows simulation results for a system 
according to the first embodiment of the present invention with two transmit 
and four receive antenna elements. 

Fig. 7 is a diagram which shows simulation results for a system 
according to the first embodiment of the present invention with two. four and 
eight receive antenna elements, using soft decisions and six turbo iterations. 

Fig. 8 is a diagram which shows simulation results for a system 
according to the second embodiment of the present Invention with two. four 
and eight receive antenna elements, using soft decisions and two turbo 
iterations. 

Figs. 9(a) and 9(b) are diagrams which show simulated perfonnance of 
the first embodiment of the present invention using soft decisions and four 
antenna elements for typical urban and hilly terrain profiles. 

DETAILED DESCRIPTION 

The documents and references cited in the following disclosure are 
incorporated herein by this reference. 
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Abstract-^By deriving a generalized Shannon capacity formula 
for muiUpie-input, multiple^utpuc Rayleigh fading channels, and 
by suggesting a layered space-Cime architecture concept that at- 
tains a tight lower bound on the capacity achievable. Foschini has 
sho%rn a potential enormous increase in the information capacity 
of a wireless system employing multiple-element antenna arrays 
at both the transmitter and receiver. The layered space-time 
architecture allows signal processing complexity to grow linearly, 
rather than exponentially, with the promised capacity increase. 
This paper includes two important contributions: First, we show 
that Foschini's lower bound is, in fact, the Shannon bound when the 
output signal-to-notse ratio (SNR) of the space-time processing in 
each layer is represented by the corresponding '^matched Hlter" 
bound. This proves the optimality of the layered space-time 
concept Second, we present an embodiment of this concept 
for a coded system operating at a low average SNR and in the 
presence of possible tntersymbol interference. This embodiment 
utilizes the already advanced space-time filtering, coding and 
turbo processing techniques to provide yet a practical solution 
to the processing needed. Performance results aro provided for 
quasi-static Rayleigh fading channels with no channel estimation 
errors. We see for the first time that the Shannon capacity for 
wireless communications can be both increased by N times (where 

is the number of the antenna elements at the transmitter 
and receiver) and achieved within about 3 dB in average SNR, 
about 2 dB of which is a loss due to the practical coding scheme 
we assume-^he layered space-time processing itself is nearly 
information-lossless! 

Index Terms — Equalization, interference suppression, space- 
time processing, turbo processing. 



L lOTRODUCTION 

TURBO" and 'space-time' are two of the most explored 
concepts in modern-day communication theory and 
wireless research. From a communication theorist's viewpoint 
"turbo" coding/processing is a way to approach the Shannon 
limit on channel capacity, while "space-time " processing is 
a way to increase the possible capacity by exploiting the rich 
muliipaih nature of fading wireless environments. We will sec 
through a specific embodiment in this paper that combining the 
two concepts provides even a practical way to both increase 
and approach the possible wireless channel capacity. 

With growing bit rate demand in wireless communications, 
it is especially imponant to use the spectral resource efficiently. 

Piper approved by K. B. Letaief. the Editor for Wireless Systems of the IEEE 
Commimicatton^ Society. Manuscript received September 15. 1999; revised De- 
cemtier 3. 1999. This paper was presented ai the IEEE (meraational Conference 
on Cocnmtmicaiionai. New Orleans. LA. June 2000. 

The auttior b wirfi the Honie Wireless Networks. Norcross. GA 30071 USA 
ie*fnail: fek^homewirelessxoml. 

Publisher Item Identifier S 009M778(00)07 1 1 1 .7. 



The basic information theory results reported by Foschini and 
Gans [1] have promised extremely high spectral efficiencies 
possible through multiple-element antenna array technology. 
In high scattering wireless environments (e.g., troposcaiier, 
cellular, and indoor radio), the use of multiple spadaily sepa- 
rated and/or differendy polarized antennas at the receiver has 
been very effective in providing diversity against fading [2]. 
[3]. Receiver diversity techniques also create signal prxKcssing 
opponuntiies for interference suppression and equalization 
(«-g-. [^H61). However, using muluplc antennas at cither the 
transmitter or the receiver does not enable a significant gain in 
the possible channel capacity According to [I J, the Shannon 
capacity for a system with I transmit and N receive antennas 
scales only logarithmically with iV. as — dc. For a system 
using N transmit and I receive antennas, asymptotically 
there is no addidonal capacity to be gained, assuming that the 
transmit power is divided equally among the N anteiuas.. 

Foschini and Gans [1] have shown that the asymptotic 
capacity of multiple-input, multiple-output (MIMO) Rayleigh 
fading channels grows, instead, lincariy with -V when N 
anteniuis are used at both the transmitter and the receiver. 
Furthermore, in [7], Foschini suggested a layered space-time 
architecture concept that can attain a tight lower bound on die 
capacity achievable. In this layered space-time architecture, 
:V information bit streams are transmitted simultaneously 
(in the same frequency band) using diversity antennas. 
The receiver uses another N diversity antennas to decouple 
and detect the N transmitted signals, one signal at a time. 
The decoupling process in each of the iV processing "layers" 
involves a combination of nulling out die interference from 
yet undetected signals (/V diversity antennas can nuU up to 
iV - 1 interferers, regardless of the angies-of-arrival 15)) and 
canceling out die interference from already detected signak. 
One very significant aspect of this architecture is that it 
allows an iV -dimensional signal processing problem — which 
would otherwise be solvable only through multiuser detection 
methods [8] with m*^' complexity (m is the signal constellation 
size) — to be solved with only /V similar l-D processing steps. 
Namely, the processing complexity grows only linearly with 
the promised capacity. 

This paper includes two important contributions. First, we 
show that Foschini*s lower bound is, in fact, the Shannon 
bound when the output SNR of the space-time processing in 
each layer is represented by die corresponding **matched filter" 
bound [6). i.e.. the maximum SNR achievable in a hypothetical 
situation where the array processing weights to suppress the 
remaining interference in each layer are chosen to maximize the 
output signal-to-interference-plus-noise ratio and any possible 
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intersymbol interference USD is assumed to be completely 
eliminated by some means of equalization. The ^maiched filter" 
bound has been shown co be approachable using minimum 
mean-square error (MMSE) space-time filtering techniques 
(61,' By showing the equivalence of ihe generalized Foschini s 
bound and the Shannon bound, we essentially prove the 
optimal ity of the layered space-time concept 

Second, we present an embodiment of Foschini s layered 
space-time concept for a coded system operating at a low av- 
erage SNR and in the presence of unavoidable ISI. Previously, 
a different embodiment has been provided in [9] for an uncoded 
system with variable signal constellation sizes, operating at 
a high average SNR without iSI. Adding coding redundancy 
might at first, seem conflicting with the desire to increase the 
channel bit rate. Our justification is as follows: First, we seek 
to enhance the channel capacity from a system perspective. 
We use "noise * in S^fR to represent all system impairments, 
including themial noise and multiuser interference. The ability 
to operate at low SNR*s means that more users per unit area 
can occupy the same bandwidth simultaneously. Second, we 
anticipate the use of adaptive-rate coding schemes to permit 
different degrees of error protection according to the channel 
SNR's. Incremental redundancy transmission [10], currently 
being considered for the Enhance Data Services for GSM 
Evolution (EDGE: GSM stands for Global System for Mobile 
Communications) standard, is an efficient way to implement 
adaptive code rates without requiring channel SNR monitoring. 
With such adaptive-rate coding, the system does not "waste" 
spectral resources under good channel conditions. 

Meanwhile, the iterative processing principle used in turbo 
and serial concatenated coding ( 1 1 ]-( 15) has been successfully 
applied to a wide variety of joint detection and decoding prob- 
lems. One such application is the so-called ''turbo equalizadon" 
I16H19J, where successive maximum a posteriori (MAP) 
processing is performed by the equalizer and channel decoder 
to provide a priori information about the transmit sequence 
to one another. Similar to the layered space-time concept, 
turbo processing allows a multi-dimensional (rn'o-dtmensional 
in this case) problem to be optimally solved with successive 
l-D processing steps without much performance penally. In 
this paper, we apply the turbo principle to layered space-time 
processing in order to prevent decision errors produced in each 
layer from catastrophically affecting the signal detection in 
subsequent layers. 

We consider two possible coded layered space-time struc- 
tures: one applying coding across the multiple signal processing 
layers, and the other assuming independent coding within each 
layer. Similar to [ 1 ), we assume a quasi-static random Rayleigh 
chaimel model, where the channel characteristics are stationary 
within each data block, but statistically independent between 
different dau blocks, different antennas, and. in the case of dis- 
persive multipath channels, different paths. The system is as- 
sumed to have similar ISI situations as in EDGE and GSM. 
where multipath dispersions may last up to several symbol pe- 
riods [20]. We show that near<apacity performance is achiev- 

' In a Hat ladtng case. MMSE airay procexMne jchir«« eucily the "matched 
filter" boond penomuncc. 



able using l-D processing and coding techniques that are al- 
ready practical and "legacy-compatible" with the EDGE stan- 
dard, e.g., the use of bit- interleaved 8-ary phase-shift keying 
(S-PSK) with rate- 1/3 convolutional coding and an equalizer 
with a similar length and structure. 

A slightly different layered space-time approach based on 
space^ime coding (23|. [24J has been studied in (251. Although 
it is difHcult to make a general comparison, we will sec later that 
our coded layered space-time approach does by far outperfomi 
the results reponed in (25] for ;V = 4 and AT = 8. On the 
other hand, for iV = 2, space-time coded quaternary phase-shift 
keying (QPSK) without layered processing appears to be the 
best known technique for achieving a spectral efficiency of 2 
bps/Hz. 

This paper is organized as follows. Section n provides a 
brief review of Foschini s layered space-time concept Section 
III describes the two coded layered space-time architectures 
and presents a capacity analysis which reveals the equivalence 
of a generalized Fbschini's lower bound formula and die true 
capacity bound. Section IV provides details on the array pro- 
cessing, equalization, and iterative MAP techniques. Section V 
presents performance results. A summary and conclusions are 
given in Section VI. 

IL Background Theory 

We briefly review die theory behind Foschini*s layered 
space-time concept. The generalized Sharmon capacity for a 
MIMO Rayleigh fading system with N transmit and M receive 
anteimas is given in (I] as 

C = log2[det(/+|:ff£f^)] (I) 

where is an A/ x iV matrix, the (i. 7)th element of which 
is the normalized channel transfer function of the transmission 
link between the ^th transmit antenna and the ith receive an- 
tenna. / is the iV/ X A/ identity matri;i. p is the average StsTR 
per receive anteiuia. and det(-)and superscript f denote deter- 
minant and ccHijugate transpose. It is assiuned that the transmit 
power is equally divided among the /V transtnit anteimas. The 
normalization of die channel transfer function is done such that 
the average (over Rayleigh fading) of its squared magnitude is 
equal to unity. 
The lower bound on capacity is provided in 1 1| as 

.V 

C> '•'Sj [l + 4 x:>*j = Cf (2) 

ts.V -.1/4-1 

where x\k ^ chi-squared random variable with 2k degrees of 
freedom. For ^^ = *V 

.V 
fcssl 

Since x\f^ represents a fading chaimel with a diversity order 
of the lower-bound capacity in (3) can be viewed as the sum 
of the capacities of .V independent channels with increasing di- 
versity orders from 1 to ;V. This suggests a layered space-time 
approach (7| for detecting the .Y transmitted signals as follows: 
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In the first layer, the receiver detects a first transmitted signal 
by nulling out interference from N - I other transmitted signals 
through array processing. Assuming a **2ero forcing" (ZF) con- 
straint, one receive antenna is needed lo completely correlate 
and subtract each interference (5]. Thus, the overall process of 
nulling A' - I interferences leaves the receiver with :V - ( A' - 
1 ) = 1 degree of freedom to provide diversity for detecting the 
first signal, i.e., a diversity order of I (or simply no diversity). 
Once detected, the first signal is subtracted out from the received 
signals on all «V antennas. 

In the second layer, the receiver performs similar interference 
nulling to detect a second transmined signal This time, since 
there are only iV - 2 remaining interferences, the receiver af- 
fords a diversity order of 2. The detected signal is again sub- 
tracted out from the received signals provided by the first layer. 

Repeating the above interference nuliing/canceting step 
ihnwgh .V layers, we sec that the receiver affords an increasing 
order of diversity from I to iV, If the capacities achieved in 
individual layers can be combined in some manner, then the 
layered space-time approach just mentioned will achieve the 
capacity lower bound expressed in (3). We will explore two 
capacity combining possibilities in the next section. 

Note that the capacity and capacity low bound given in 
(IH3) are actually frequency-dependenL We here provide an 
explicit capacity fomiula for band-limited, frequency-selective 
channels (some variables are redefined to be consistent with 
later analytical development). 

C = (log2(det(«H-Ml} (4) 

where, as shown in equations (5H8) at the bottom of the page. 
92 is the frequency-domain correlation matrix of the signals 
on A/ receive antennas. is the noise power density at 

frequency / on the >th receive antenna. T is the symbol period. 



Hnj{f) is the channel transfer funaion (not normalized) of the 
transmission link between the ith transmit antenna and the 
jth receive antenna, and superscripts • and T denote complex 
conjugate and transpose. Note in (7) and (8) that we consider 
the folded spectra H,j{f - (m/D) and H>(/ - (m/T)) of 
the channel transfer function and noise power density, where 
m = -J. .... J (J is finite because the signal sources are 
assumed to be band-limited). This is to take into account the 
effect of excess bandwidth and symbol -rate sampling when 
the frequency selectivity of the channel is not symmetrical 
around the Nyquist band edges. Even though we assume white 
Gaussian noise, the noise power density near and outside the 
Nyquist band edges actually attenuates with the receive filter 
transfer function. From our experiment (assuming a square-ixx>t 
Nyquist filter with a 50% rolloff factor), the computed capacity 
can be underestimated by as much as 0 J dB if this attenuation 
is not taken into account. 

m. Cooed Layered Space-Time ARCHITECTURES 

A. Basic Concepts 

We consider two coded layered space-time approaches as 
shown in Fig. 1(a) and (b). In the first approach, named **LST-r 
(LST stands for 'layered space-time**), the coded infoimation 
bits are interleaved across the /V parallel data streams xi. 
X2, • x/v, where Xi denotes a sequence of complex- valued, 
transmit data symbols (e.g., 8-PSK symbols). The receiver first 
decouples the .V data streams through interference nulltng^can- 
cellation. as described in Section II, then deinterleaves and 
decodes all the N data streams as one information block. In 
the second approach. **LST-n,** the information is first divided 

into iV uncoded bit sequences tfi. tt2 u^r, each of which 

is independently encoded, interleaved, and symboUmapped to 
generate one of the N parallel data streams. At the receiver, the 
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Fig. I. Coded layered space-cime archicecture: u> LST-l and (b| LST-II. 



.V data sireams arc decoupled and indcpcndenily deinterieavcd 
and decoded. The output of LST-O produces A' infoimation 
blocks at a rate of 1/:V times the output rate of LST-L 

In Fig. 1(a) and (b). "space-time equalizer" refers to a com- 
bined array processing (for interference nulling) and equaUza- 
tion function. Instead of the ZF criterion, we assume that the op- 
tiinizaiion of the antenna/equalizer weights is based on a MMSE 
criterion, which in general provides better performance than a 
ZF approach. Foschini [7| has also indicated a potential per- 
forrnance benefit of using MMSE (or ^maximum SNR") rather 
than ZF in a layered space-time architecture. Although we show 
A/ receive antennas in Fig. \U) and(b) (.V/ > .V is the suffi- 



cient condition for nulling fV- 1 interference), we only consider 
A/ = iV in this study. 

Similar to (9 J, the underlying assumption of our layered 
space-<ime architecture is that the receiver can order the detec- 
tions of /Y data sireams such that an undetected layer always 
has the strongest received SNR. In LST-L the spacc-<ime 
equalizer in each layer must provide data decisions x \(,) (A de- 
notes the permutation due to layer ordering) to the interference 
canceller, since decoding cannot be done until alt the layers 
are processed. In LST-IL the interference cancellation in each 
layer can use more reliable data decisions u\(ii provided by 
the decoder. Thus. LST-I is more prone to decision errors than 
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LST-n. In order to minimize the effects of decision errors, and 
ako to improve the joint detection/decoding performance in 
general, we assume the use of nirbo processing in our layered 
space-lime architecture. As shown in Fig. 1(a) and (b), the 
space-time equalizers and the decoders provide extrinsic soft 
infonnaiion lo one another by subtracting the received soft 
information from the newly computed soft information. Details 
on MMSE space-«ime equalization and turbo processing will 
be provided in the Section IV. 

B, Capacity Analysis 

Without getting into the detail of all the processing ftmctions. 
we first discuss the general differences between the two coded 
layered space-time approaches. In particular, we are most inter- 
ested in the capacity cotnbining aspects of the two approaches. 

Let SNRit denote the output SNR of the array processing in 
the A:th layer. Fu-st, we note that, in LST-II. the capacity of each 
processing layer is bounded by the specoral efficiency R of the 
modulation and coding in each layer, e.g.. = 1 for 8-PSK 
with rate 1/3 coding. Thus, the total capacity of LST-II is given 
by (similar to (l>-<3). we write capacity without showing the 
frequency dependence) 




.V 



(9) 



Without layer ordering, it is most likely that the overall perfor- 
mance of LST-II will be largely influenced by the error proba- 
bility of the first processing layer with a diversity order of only 
1. In contrast, our simulation results in Section V will show that 
LST-n with layer ordering can actually achieve a diversity order 
of approximately N. 

Since coding is performed across ail die processing layers in 
LST-1, the achieved SNR in each layer will contribute to the 
overall layer processing performance. As Foschini [7] indicated, 
such a coding scheme should be able to achieve the capacity 
lower bound in (3). Here, we provide a generalized formula of 
Foschini s lower bound by removing die ZF constraint and in- 
stead using SNRik as the generalized outpuf SNR. 



s 



(10) 



Reference (6) provides output SNR formulas for different 
types of optimum space-time processors. Here, it is of great 
interest to express the capacity lower bound using the best per- 
formance achievable. In die following equation, we tepresent 
SNRjt in (10) by the '^matched filter'* bound-die maximum 
achievable SNR by any space-time processing receiver 



iog2(i + rt(/)) 



) 



(II) 



where Tk{f) is the ^'matched filter" bound^ given by equation 
(15) in Section IV-A (simply a rewriting of die result in (6|). 

-Note that the "matched filler" bound usually refen to the integnucd SNR 
(Hi ( / )) over the wgnal bandwiddi (e.g., (6|). However, in the capacity context, 
we assume die beM pos.sible way to exploit die SNR s in all frequency oxnpo- 
nems 
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Fig. 2. Space-time DDFSE v/ith MAP processing 

Note that (1 1) is an explicit formula similar to (4); it shows the 
frequency dependence of die output SNR and die integration of 
capacity over die signal bandwidtii. Also, we assume diat die 
A:th layer has A; - 1 interferences. 

In die process of analyzing the meaning of (1 1), we discov- 
ered an identical relationship between ( 1 1 ) and (4) regardless of 
how die layers are ordered. We show the proof in die Appendix 
(tius proof is valid even when A/ # .V), Thus. Foschini s lower 
bound (3) is actually the true Shannon capacity bound when 
the output SNR of the space-time processing in each layer is 
represented by the corresponding *' matched filter" bound. This 
proves the optimaliiy of die layered space-time concept 

The capacity analysis presented above is based on die as- 
sumption of perfect layer detection. i.e., no decision cnois af- 
fecting the detection in subsequent layers. In reality, LST-I is 
more prone to decision errors than LST-II and layer ordering 
becomes important for both schemes. Our simulation results in 
Section V will demonstrate how decision errors affea the actual 
performance of die two coded layered space-time approaches. 

IV. Signal PROCESSING Functions 
A. Space-Time Equalization 

We consider combined array processing and equalization in 
order to cope widi dispersive channels. A space-time equalizer, 
consisting of a spatial/temporal whitening filter, followed by 
a decision-feedback equalizer (DFE) or maximum-likelihood 
sequence estimator, can suppress bodi ISI and dispenive 
interference [6J. The space-time equalizer used in this study 
is shown in Fig. 2. It consists of a linear feedforward filter 

^>(/). J = 1 . V/, on each diversity branch, a combiner, 

symbol-rate sampler, soft-input, soft-output (SISO) MAP 
sequence estimator, and synchronous linear feedback filter 
B*{f). The feedforward filters {M'j(f)\ are shown as con- 
tinuous-time filters, but diey can be implemented in practice 
using fractionally-spaced tapped delay lines. The combined 
use of a sequence estimator and feedback filter after diversity 
combining is similar to die structure of a delayed decision-feed- 
back sequence estimator (DDFSE) (27|. Thus, we refer to die 
space-time equalizer in Fig. 2 as a "space-time DDFSE." A 
''space-lime DFE" is a structure where the sequence estimator 
is replaced by a memoryless hard slicer. 

It has been shown in [20] diat a space-time DDFSE widi a 
sequence estimator memory of /i and a feedback filter of length 
Lo - /i can be optimized in a MMSE manner as if it was a 
space-iime DFE widi a feedback filter of length Lq. In fact. 
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numerical results in [6J showed that an optimum space-time 
DFE (with unconsuained filter lengths and no feedback decision 
errors) can perform within only 1-2 dB of the ideal **maiched 
filter" bound performance. Thus, in order to have a practical 
receiver structure for layered space-time processing, we con- 
sider a space-time DDFSE with a minimum sequence estimator 
memory, i.e.. = I. The sequence estimator is used only to 
provide a trellis smicture needed for turbo processing, and pre- 
sumably more reUable feedback decisions than the slicer used 
in a space-cime DFE. Details on MAP processing will be given 
in Section IV-B. 

We first provide a brief review of the space-time filtering 
theory. Based on the space-time DFE equivalent model de- 
scribed above, the MMSE solution for the feedforward filters 
with unconstrained length can be given using the 
results of {6J (see also [281) 

w = »r»H:(i + B(/)) = «-^/r;i±^ (12) 

where 

k 

»t = j;Hr^f+K (13) 

••• • • ^^'v/(/ + ^)]'^ (14, 

r*(/) i ^J-Rri,^. (15) 

In the above equations, we assume that there arc a total of k 
signal sources and we use Eh to indicate the channel vector 
[sec (8)] of the tUsired signal. The remaining k - I signals 
are interferences. B{f) in (12) is the feedback filter of the 
space-time DFE, which, from our assumption of = 1, is 
only "I tap- longer than B'{/). r*(/) is the signal-to-inter- 
ference-plus-noise power density ratio at frequency /. i.e.. 
the '•matched filter'' bound. B{f) can be detcnnined through 
spectral faaorization of 1 + TuUY 

Equation (12) indicates that the optimum feedforward filter 
consists of a spatial/temporal filter 'Stll^H\. which performs 
prewhitening (Rfci(^ is the whitening filter of interference and 
noise) and matching to die desired channel, followed by a tem- 
poral filter (1 + B{/))/(l + rfc(/)), which is an anticausal 
post-whitening filter for suppressing precursor ISL 

A filter length analysis of the optimum space-time DFE de- 
scribed above is provided in (61. We will first consider a fi- 
ntte-length realization of the space-time DFE based on the re- 
sults presented there. We assume that the system has similar 
ISI situations as in EDGE and GSM. Namely, using die mul- 
tipath delay profiles specified for EDGE and GSM (see Ta- 
bles I and ID. and assuming the same symbol rate of 270.833 
kbaud (T = 3.692 /is) with Nyquist filtering (panial response 
signaling is used in EDGE and GSM), the ISI lasts up to five 
symbol periods for the hilly tcnrain (HT) profile in Table II. Ac- 
cording to the empirical filter length formulas in [6). the feed- 



TABLE I 




GSM Tyhcal Urban (TU) Channel Model 




rain i/ci«y ipsi u.u u.* o.j i.o 2.3 


3.0 


Path Power (dB) -3.0 0.0 -2.0 -6.0 -8.0 


-10.0 


TABLE U 




GSM HOXY TERRAIN (HD CHANNEL MODEL 




Path Delay (|U) 0.0 0.2 0.4 0.6 13.0 


17.2 


Padi Power (dB) 0.0 -2.0 ^.0 -7.0 -6.0 


-12.0 



forward filter on each branch should have the following causal 
and anticausal lengths to achieve near-optimum performance 

Ic^/^OV- l)(pdB/10) 
LA^K^KN{pA^I\{j;) (16) 

where K is the channel memory. .Y is tised here to indicate 
the total number of signals, including the desired and interfer- 
ence signals, and pae « die average SNR in decibels. In our 
case, /T = 5. and assume for example Uiat die system has four 
transmit and four receive antennas (iV = 4) and the operating 
range of average SNR is around 5 dB (pdB = 5). The required 
filter length, including die center tap, will be Lc + 1.4 + 1 S5s 
23.5. This is a highly impractical number, considering that four 
such filters are required, one per each receive anteima. Fur- 
thermore, as mentioned earlier, the optimum feedforward filters 
should be implemented using fractionally-spaced tapped delay 
lines. If a r/2-spaced filters are used, the total number of taps 
will be doubled. Such a space-time system with about 200 co- 
efficients would be nearly impossible to compute in any radio 
link design. 

Faced with such impracticality of an itleal signal processing 
arrangement, we proceed to consider a suboptimum option. 
First, we will use symbol-spaced instead of fractionally-spaced 
feedforward filters. In order to avoid significant performance 
penalties, a channel estimation-based timing recovery algo- 
rithm described in [29] will be used to optimize die symbol 
timing and the decision delay of the center tap relative to the 
measured channel impulse response. In principle, such timing 
optimization also allows the DFE to use a feedforward filter 
with a shorter span than the channel memory while achieving 
a reasonable performance [29|. [30]. After experimenting with 
a number of significantly reduced filter length options, we 
decided on the following suboptimum space-time equalizer 
structure. The feedforward filter on each branch has a total 
of nine symbol-spaced taps, which are positioned such that 
Lc = L ^ = 4. The feedback filter has a length of 8. i.e.. 
Lq ^ 0 with the MAP processor memory p = I included 
(in order to completely cancel postcursor ISL Lq must be jt 
least as large as the channel memory plus the number of causal 
taps in the feedforward filter). The method in (29| is used to 
optimize the symbol timing and the decision delay of the center 
tap as described above. Direct matrix inversion is used to set 
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all the filter coefficients in a standard MMSE linear processing 
fashion (4J. [26], [31). assuming perfect channel estimation. 

B. Turbo Processing 

The turbo processing technique used: in this study is also 
based on a standard approaclv--Hhe reader is referred to the 
rich literature (IIH19], f2IH22I, {32H35] for a thorough 
treatment of this subject. The space-4ime equalizer and th? 
decoder both performs SISO sequence estimation to compi!'. 
the a posteriori probability (APP) of the transmit data symbo. .. 
This sequence estimation is done using the Bahl-Cocke-if:- 
linek-Raviv (BCJR) forward/backward algorithm. In the 
following, we describe the basic principle of the iterative 
detection/decoding process. 

Using the BCJR algorithm, the MAP processor in the 
space-time equalizer with m** states (m is the signal constel- 
lation size, e.g., m = 8 for 8-PSK, and = 1 in our case) 
computes the APP F(c*|yj of the ikth coded bit based on 
the observation where y is the equalizer output sequence 
corresponding to all the data symbok in a received block (see 
Fig. 2). and the a priori information provided by the decoder 
(this is not available in the fust **turbo** iteration). The logarithm 
X(ck) = loge(P(ct|y]) of this APP can be regarded as the sum 
of two tenns 

A(ca) = A»'(c&) + A*(c4) (17) 

where X^(ck) = iog^(P[cit]) is the logarithm of the a pndr in- 
formation provided by the decoder, and A* ( ) is called the •'ex- 
trinsic" information. In each "turbo" iteration, the space-cime 
equaUzer subtracts X^{ck) from the newly computed value of 
A(cfc) to obtain the extrinsic information A*(cfc) (see Fig. 1(a) 
and(b)l. The entire sequence {A«(cfc)} is deinterleaved and for- 
warded to the decoder. 

Similarly, the decoder computes the log-APP 
c(ct) = bge(P{cfr|{A«(ct)}J) based on the deinterleaved 
extrinsic information provided by the space-time equalizer, 
and subtract A'(cft) from it to obtain extrinsic information 
v^{ckh The extrinsic information is dien interleaved and 
forwarded to the equalizer as the new a priori information 
X''{ck) for the next "turbo" iteradon. 

The interieaver considered in this study is a pseudo-random 
interieaver, i.e., we generate a pseudo-random permuution of 
numbers from 1 to L where / is the block length, and then use 
this permit>.^;ion as a fixed interieaver. 

In comb - ng the branch metric obtained from the equalizer 
output with the branch metric obtained from the soft input 
provided by the decoder, the MAP processor in the space-time 
equalizer must compute the a priori informadon for each 8-PSK 
symbol Xk from die three soft inputs (A'*(c34), A^^Cca^.^,), 
A^(c3it<f3))- We assume that this is done by way of summing 
the three soft inputs as if the three coded bits were transmitted 
from indepe-r-nt sources (these soft inputs are actually not 
independenc nen conditioned on the observed waveform of the 
entire data burst). This is a suboptimal mediod: which is known 
to catise a "random modulation" performance degradation 
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effect in bit-interleaved coded modulauon [36W38). However, 
this effect can be overcome by iterative decoding [38). which is 
implicit in our nirbo space-iime processing approach. 

As noted earlier, in LST-I, die space-time equalizer in each 
layer must provide immediate data decisions to be used for inter- 
ference cancellation. Since these decisions are not "protected" 
by coding, they are prone to errors. In diis snidy, we explore 
a soft decision technique to minimize the effect of decision er- 
rors. The optimum soft decision can be computed by averaging 
all die possible transmit symbols weighted by dieir APP*s [39] 

xt = ^ sP[xk = x\y] (18) 

»€.Y 

where X includes all the complex-valued 8-PSK constellation 
points. Since P[xk = x\y] can be obtained along widi die com- 
putauon of die APP P[ck\yl diis soft decision approach can be 
implemented widi nearly no additional cost in complexity. Sim- 
ilariy, we apply die same technique to compute soft decision 
outpuu in LST-D. 

V. Performance RESULTS 
A. Performance Criteria and System Assumptions 

We now present performance results of die layered 
space-ume concepts described so far. The performance mea- 
sure is the block-error rate (BLER) over Rayleigh fading. 
The results are obtained dirough Monte Carlo simulation. The 
BLER is averaged over up to 40 (XX) channel realizadons. Each 
block contains 400 informadon bits (before coding). 

In comparing the perfonziance residts to channel capacity, 
we follow die convenuon of a number of previous worics (e.g., 
[9], [23]) to compare the computed BLER widi the "outage ca- 
pacity" (IJ, i.e.. the probability that a specified bit rate is not 
supported by the chaimel capacity. This is a vague comparison, 
since the Shaiuion limit refers to the highest error-free bit rate 
possible for long encoded blocks but it does not specify how 
long the blocks should be. Nevertheless, such a comparison 
should sUll be meaningftil as long as the block lengui and BLER 
are specified. This is similar to the way a bit-error rate of 10~^ 
is commonly used as die "error free** reference for an additive 
white Gaussian noise ( AWGN) channel 

In order to assess the best performance achievable, we as- 
sume that the chaimel characteristics can be perfecdy estimated 
at the receiver. Similariy. the choices of 1-D processing and 
coding techniques are important to deliver the best possible 
performance. We try to opdmize these choices while keeping 
them as practical as possible. Except for the use of array 
processing and iterative MAP algorithms, all the radio link 
techniques assumed in this study are "'legacy-compatible** with 
die EDGE standard (note also that vast research interest in turbo 
coding has made simplified MAP algorithms available (21|. 
[22| that are not much more complex than the conventional 
Viterbi algorithm). None of these techniques are claimed 
CO be optimum. Yet. our results indicate that ncar<apachy 
performance is achievable when combining them through the 
coded layered space-<ime architectures. 
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Fig. 3, PcrfoniMRce of btt-interieaved 8-PSK wiih rate- 1/3 convoluttonaJ and 
tufbo coding over U> AWGN channel, and <b) quasi*staiic flat Rayteigh fading 
chinneb with .V lecehre diveniiy antennas. 

B, Choosing the Catling Schemt 

We consider a bit-intericaved coded moduiation scheme 
using 8-PSK with Gray mapping and rate-[/3 coding. 
Square-root Nyquisi filtering with 30% roUoff is assumed at the 
transmitter and receiver Bit-interleaved coded moduiation has 
been shown (36], [37] to ouqyerfoim traditional treiiis-codcd 
modulation in fast fading channels (whm time diversity can 
be exploited through sufftciem interleaving) and it can be 
improved upon by considering a better mapping technique 
that permits a large Euclidean distance without sacrificing the 
maximum Hamming distance of the baseline coding scheme 
[38]. In this paper, chough, since quasi-static fading is our 
basic assumption, the code by itself must be able to withstand 
deq> fades. In principle, any code that performs well in an 
AWGN channel is considered a good candidate — turbo codes 
are among the strongest candidates that come to mind. 

Fig. 3(a) and (b) provide a performance comparison between 
two rate- 1/3 coding schemes: one using a 64-state convolutional 
code with (octal) generators (d, C?. G3) = (155. 117, 123) 
(the same code as proposed for EDGE I20|) and the other using 
a turbo code with two identical 16-scate recursive encoders sim- 
ilar to the scheme originally proposed by Bernm and Glavteux 
[12] (the results here assume generators (C|. C^) = (23. 31). 
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which appeared to perform slightly bcncr than other genera- 
tors we tested). Both schemes assume the use of bit-interteaved 
8-PSK wiih Gray mapping. The mrbo coduig scheme has an 
additional interleaver within the encoder, which uses another 
pseudo-randomly generated permutation. The receiver structure 
is consistent with what we have described so far. Note, however, 
that we assume a minimum number of filter taps (only one feed- 
forward tap per branch and no feedback filter) whenever there 
is no delay spread assumed, although the MAP processor in the 
DDFSE is always used for iterative detection/decoding as de- 
scribed in Section I V-B. For the turbo coding scheme, "one itcr- 
ation- means a fiiil cycle of three processes: I) MAP processing 
in DDFSE; 2) mrbo decoding by the first decoder, and 3) nirbo 
decoding by the second decoder 

Rg. 3(a) shows the performance of the two coding schemes in 
an AWGN chaimei. First, we note that the performance of con- 
volutional coding also benefits from iterative processing. This 
is due to the suboptimai nature of the decoding scheme, i.e.. 
the "random modulation" effect described eariien which can be 
improved through iterative decoding. Fig. 3(a) shows that most 
of the improvement is achieved within two decoding cycles. For 
mrbo coding, the performance still improves even afier five iter- 
ations, but saturates quickly after ten iterations. At 10"^ BLER 
(approximately equivalent to 10*^ bit-error rate), turbo coding 
outperforms convolutional coding by about 2.2 dB. and the re- 
quired SNR is within only 2.4 dB of the 0-dB Shannon limit for 
a spectral efficiency of 1 bps/Hz (8-PSK with race-1/3 coding). 

However, when we look at the average BLER performance 
over quasi-static flat fading channels in Fig. 3(b), the benefit 
of turbo coding (with ten iterations) over convolutional coding 
(with two iteradons) is reduced to only about 0.5 dB at any value 
of the average SNR and for all die assumed numbers of receive 
diversity antennas. This is not surprising for two reasons: First, 
it is well known diat the average BLER is determined mostly 
by the probability of fading events diat results in high BLER's. 
If we look at the relative performance at a BLER of, say. above 
10% in Fig. 3(a), the difference between the two coding schemes 
is indeed less dian i dB. Second, the performance of convolu- 
tional coding over fading channels is already within about 2 dB . 
of the capacity bound — che capacity bound in tliis case is de- 
fined as the probability that the combined output SNR of all 
diversity branches is below the 0-dB Shannon linrir Thus, there 
is not much room for further improvement. 

Based on the fact that the performances of the two coding 
schemes are quite similar in quasi-sutic fading channels, we 
will only consider convolutional coding in the remainder.of this 
paper. 

C. Layered Space-Time Performance 

We first look at the performance of the two coded layered 
space-time approaches over a flat Rayleigh fading channel. 
Fig. 4 shows the different capacity bounds for this channel, 
assuming .V =s 2. 4. and «J. where is the number of transmit 
and receive antennas. Again, although we plot the results as 
"block error rate." the capacity botmd is defined as the proba- 
bility that the specified spectral efficiency R (R = ,V in this 
case) is not supported by each of the differently defined channel 
capacities. C denotes the Shaimon capacity botmd given by (4) 
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Fig. 5. Layered space>time perfonnance of LST4 with two trutsinit and two 
receive antennas (.V =z2). Quasi-static flat Rayleigh fading channel. 



[which is equivalent lo the generalized Foschini bound Cr. .uf 
in (1 1)], denotes die original Foschini bound (with the ZF 
constraim) in (3), and Clst-h is ihe capacity bound for LST-D 
in (9) (however, die results for Clst-ii are obtained simply 
by averaging the probability Uiat /? = I is not supponed by 
each processing layer). Note diat Clst-ii can indeed provide 
approximately a diversity order of N; this is aiuibuted largely 
to the use of layer ordering as discussed earlier. Note also that 
all the bounds show an improvement with increasing N. This 
means that the capacity actually increases more than linearly 
widi the number of transmit and receive antennas. However, 
there is a diminishing improvement as N increases to a much 
laiger number. 

Rg. 5 shows the simulauon results for LST-I with 2 transmit 
and 2 receive antennas (i.e., N = 2). Three sets of results are 
provided, assuming: 1) soft decisions: 2) hard decisions: and 3) 
correct decisions in each layer (note that the DDFSE always uses 
tentative decisions and provides soft outputs to the decoder). 
We see that although soft decisions offer some improvement 
over hard decisions, the impact of decision errors is still quite 
nouceable. Fig. 6 shows similar results for N = 4, Here, die 
impact of decision errors is not as significant as the previous 
results, and turbo processing and soft decisions help to reduce 
much of this impact. With three iterations, the effect of decision 
errors almost completely disappears when using soft decisions. 
Decision errors have a lesser effect for a larger ;V because of the 
greater diversity order available through array processing and 
layer ordering. 

In Rg. 7. we compare the results using soft decisions and six 
^turbo** iterations wtUi the Shannon capacity bound. For N ^ 4 
and 8. the perfonnance of LST-I is widiin 2.5-3 dB of the ca- 
pacity bound at 10% BLER (and about 3-3.5 dB at 1% BLER). 
Since the BLER may vary as a function of the block size,^ it 
is also important to consider the processing loss by discounting 
the loss due to the inefficiency of modulation and coding. As 

-.As an example, when we double the block size, the required average SNR 
is 0.2-0.4 dB greater than Che results shown here. However, this difference in 
average S NR applies vnifocmly to all results, with or without layered space-time 
processing. 
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Fig. 6. layered space-time perfonnance of LST-1 with four transmit and fotir 
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Fig- 7. Layered space-cime perfonnance of LST-I for .V = 2. 4. and S. Soft 
decisions. 6 iterations. Quasi-static flat Rayleigh fading channel. 



shown in Fig. 3(b). there is already a gap of about 2 dB between 
the performance of our coding scheme and the Shannon limiL 
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Fig. 8. Layered space-Kime performance of LST-II for S = 2. 4. and 8. Soft 
dectsions. 2 iterations. Quasi-sutic flat Rayieigb fading channel. 



Thus, the actual loss due to layered space-time processing is 
only about 0.5-1 dB at 10% BLER. 

Fig- 8 shows the performance of LST-II for N = 2, 4. and 
8. compared to the Shannon and Cisr-ti bounds. Soft deci- 
sions and two "^urtx)" iterations are assumed. (In this case, we 
found the effea of decision errors to be marginal. t.e., the re- 
sults with hard and correct decisions were generally within I 
dB of the results shown here. Also, we found turbo processing 
with more than two iterations to provide little improvement.) 
At 10% BLER, the performance of LST-II is 2.5 dB from die 
-CtsT-ii bound, and 3.5 dB from the Shannon bound. At 1% 
BLER, however, the loss compared to die Shannon bound can 
be as much as 6 dB. 

From the above results, we conclude that, for iV = 4 and 
8, LST-I outperforms LST-II by a margin of 0.5 dB (at 10% 
BLER) to 3 dB (at 1% BLER). For TNT = 2. die performance of 
LST-I is greatly affected by decision errors (note diat, even in 
this case. LST-I still performs as well as LST-II at 10% BLER), 
whereas LST-II can reach a lower BLER at high average SNR. 
Based on these results, die layered space-time approach is 
not highly recommended for iV s= 2. As mentioned earlier, 
space-cime coding is a better alternative to achieve a spectral 
efficiency of 2 bps/Hz. For instance, a 64-staie space-time 
coded QPSK can perfomi to widiin 2 dB of the Shannon 
capacity bound [23]. 

D. Frequency-Selective Channels 

Finally, we present an example of perfonnance results for fre- 
quency-selective fading channels. This example assumes /V = 4 
and die use of soft decisions for bodi LST-I and LST-II. Fig. 9(a) 
and (b) show die results for die TU and HT profiles, defined in 
Tables I and 11. Again, wc only show results with two "nirbo" ii- 
eradons for LST-II because liidc improvement can be achieved 
with more iterations in diis case. For both delay profiles, die per- 
formance at 10% BLER is widiin 3 dB of (he Shannon bound 
for LST-I widi six iterations, and within 4 dB for LST-II widi 
two iterations. At a lower BLER. die loss relative to die bound 
is greater for HT dian for TU. This is due to die limiution of 
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Fig. 9. Layered space-cime performance of LST-I and l^-U over 
frequency-seleaive channels: (a) TU profile, (b) HT profile. .V = a. Soft 
decisions. 

die subopumum space-time equalizer structure we assume, as 
already discussed in Section IV-A. 

VI. Conclusion 

By deriving the generalized Shannon capacity formula and 
suggesting a layered space-time architecture that attains a ught 
lower bound on the capacity achievable. Foschini has laid a 
significant theoretical foundation for improving the wireless 
channel capacity through multiple -clement array technology. 
We have shown that Foschini *s lower bound is actually the 
true Shannon bound when the output SNR of the space-time 
processing in each layer is represented by the corresponding 
"matched filter" bound. We then provided two coded layered 
space-time approaches as an embodiment of this concept. For 
a large number of transmit and receive antennas, coding across 
the layers provides a better performance than independent 
coding within each layer. However, with two transmit and two 
receive antennas, the former is heavily atTectcd by decision 
errors and. therefore, provides a poorer perfomiance than the 
latter. 

The underlying coding and signal processing techniques used 
in this study are based on practical but suboptimal approaches. 
Yet. such suboptimality can be gready compensated for by it> 
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crativc processing. OveralL our coded layered space-dme ap- 
pnMches can achieve a performance within about 3 dB of the 
Shannon bound at 10% BLER. about 2 dB of which u a loss 
due to the practical coding scheme we assume. Thus, not only 
is the layered space-time aichitecture exactly what the Shannon 
limit has prescribed in a cheoreticai sense, but it also provides 
an attractive general methodology for improving ami achieving 
the wireless chaiuiel capacity. 

APPENDIX 

Proving the Equivalence of Fosomo Bound and 
Shannon Capaoty 

Using the nuuhematical induction method, we will prove that 
(4) and (1 1) are identicaL In order to do so, we must show that 

dct(IWM = U^l + r*(/)) (19) 

where S in the above equation is equivalem to Us defined in 
(13). Again, we assume diat the ibth layer has 1 interferences. 
Note also that the proof provided here is independem of the 
number of tecetve antennas M (the dimension of S) and the 
way the layers ate ordered. 

We Stan by assuming 1 signal source and AfreceWeantemas. 
It can be easily shown (1] that 

det(RiK-») » I + irrM-^ffi = I + rx(/). am 

Next, we assume that (19) IS true for the case of n- 1 signal 
sources, l^., 

det{»„.iK-M = 11(1 + r»(/)). (21) 

kml 

We then Show in the following that, given (21), (19) b also true 
for n signal sources. 
Rtst, we note from (13) that 



(22) 



Using the matrix inversion lemma (4, Appendix D], we can 
show that (similar to (12)1 



b follows that 



i+rn(/)" 



(23) 



(24) 



For convenience, let 

A i !i„K-' and S ^ ».-iK"'. 
can rewrite (24) as 



i+r.(/) 



(25) 



(26) 



Funheimoie. using tiw matrix identity (40] 

''" = ^ on 

where is called the adjugate matrix of matrix we can 
rewrite (26) as 

det(il)^" det(B) l + r«(/)' ^> 
By replacing det(B) in the above equation using (2 1 ), we obtain 

det{A) . <29) 

kml 

Our goal is to pro ve that 

n 

deKil) = 11(1 + r»(/)). (30) 

kml 

Thus, given (29). we nnist show that 
From (22) and (25X we have 

where bj is the jth column vector of B, for > = I, - • • , Af ; 
for convenience, we use Af heie to indicate the averali tecetve 
diversity cmlen indnriing the effects of both multiple ametm^s 
and excess bandwiddi. 

Wb now prove (31) by showing that the jth elemem 
of it«dji^ is equal to the jth eietnem of B^fSl, for 
j = if. .Af. Note that dsejdi element of iUajH^ is given 
by det(7j), where 'Xj is obtained by rephunng die jth column 
of A by £n^. Similarly, die Jth element of B^BZ is ghren by 
det(Bj)^ where 'Sj is obtained by replacing the >th cohimn of 
B by El. 

Using (32) and the linear properties of detefminants, we can 
showdiat 

jih 

[ff i t\ cohmui ^ 
... b: J 

03) 

sdet^.' -.J?:. ^ 

= dct[6i. ...ir;, •) 

=.irt|6,. ...it:. 1. 
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Similarly, we can expand the result of (34) with respect to the 
second column, the third column, and so on (except for the jth 
column). Eventually, we obtain 

jth 
column 

dct(:4,) = det(6i.62, B\ . • • . 6*/] = det(B>) 

(35) 

which proves (31). The proof of ( 1 9) is therefore complete. 
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According to the present invention, the receiver can select a set of 
antenna elements, including their number and / or identity, from among a 
larger group of antenna elements in order, among other things, to improve 
perfonnance of the system without increasing the extent of radiofrequency 
circuitry. One process for selecting antenna elements is to utilize equation 4 
above as a measure of quality for the particular set of antenna elements being 
evaluated. That evaluation can occur for each permutation or combination of 
antenna elements in order to select the subset with optimum perfonnance (as 
detemiined. for instance, by selecting the subset with greatest value 
calculated according to equation 4). This can occur at whatever desired 
points in time, including periodically. 
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CLAIMS 

What is claimed is: 

1 . A radio receiver coupled to a plurality of receiver antenna elements, 
comprising a plurality of layers for processing signals received by the receiver 
elements, each layer comprising: 

a space-time equalizer, the space-time equalizer in a first layer of the 
plurality of layers coupled to each of the receiver elements, the space-time 
equalizer in each of the other layers of the plurality of layers coupled to an 
interference canceller which receives output from the layer preceding each 
said other layer. 

2. A receiver according to claim 1 further comprising a deinterieaver 
coupled to each space time equalizer and to a decoder for output 

3. A radio receiver according to claim 1 in which each layer comprises its 
own deinterieaver and decoder, and further comprises an interieaver adapted 
to receive output from the decoder in said layer and from the deinterieaver in 
said layer, the interieaver feeding output to the equalizer in said layer, thereby 
allowing soft decisions about information being processed iteratively by said 
layer to be fed back and forth between said equalizer and said decoder in said 
layer. 

4. A radio receiver acconiing to claim 3 in which ttie interference canceller 
which receives output from the layer preceding each said ottier layer receives 
information from the decoder and the interference canceller In the preceding 
layer. 

5. A radio receiver according to claim 1 in which the equalizer in each 
layer is connected to a common deinterieaver, which feeds a common 
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decoder, and in which a common interleaver receives signals fror. the 
decoder, and is coupled to each equalizer in order to provide interleaved 
signals to the equalizer. 

6. A radio receiver according to claim 5 in which the interference canceller 
which receives output from the layer preceding each said other layer receives 
signals from the equalizer and from the interference canceller in said layer 
preceding each said other layer. 

7. A radio receiver according to claim 1 in which each space-time 
equalizer performs equalization using a minimum mean-square error criterion. 

8. A radio receiver according to claim 1 in which the receiver is adapted to 
receive and process signals from a transmitter that Is coupled a plurality of 
transmit antenna elements, the number of fransmit antenna elements to which 
the transmitter is coupled being less than the number of receiver antenna 
elements to which the receiver is coupled. 

9. A radio receiver according to claim 8 in which the receiver is adapted to 
receive and process signals from a transmitter that is coupled to N transmit 
antenna elements, the number of receiver antenna elements to which the 
receiver is coupled is M. and M is greater than N. 

10. A radio receiver according to claim 9 adapted to be coupled to at least 
one set of M receiver antenna elements out of K available receiver antenna 
elements. 

11. A radio receiver according to claim 1 in which the receiver is adapted to 
select the sequence in which information from the receiver antenna elements 
is to be processed. 
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12. A communications system, comprising: 

a radio transmitter coupled to a stream of information and to a plurality 
of transmit antenna elements, said transmitter adapted to apportion a portion 
of the stream of information to each transmit antenna element by interleaving 
said portions of the infomiation stream among the transmit antenna elements; 
and 

a radio receiver coupled to a plurality of receiver antenna elements, 
comprising a plurality of layers for processing signals received by the receiver 
elements, each layer comprising: 

a space-time equalizer, the space-time equalizer in a first layer of the 
plurality of layers coupled to each of the receiver elements, the space-time 
equalizer in each of the other layers of the plurality of layers coupled to an 
interference canceller which receives output from the layer preceding each 
said other layer. 

13. A system according to claim 12 further comprising a deinterieaver 
coupled to each space time equalizer and to a decoder for output 

14. A system according to claim 12 in which each layer comprises its own 
deinterieaver and decoder, and further comprises an interieaver adapted to 
receive output from the decoder in said layer and from the deinterieaver in 
said layer, the interieaver feeding output to the equalizer in said layer, thereby 
allowing soft decisions about infonnation being processed iteratively by said 
layer to be fed back and forth between said equalizer and said decoder in said 
layer. 



15. A system according to claim 14 in which the interference canceller 
which receives output from the layer preceding each said other layer receives 
information from the decoder and the interference canceller in the preceding 
layer. 
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16. A system according to claim 12 in which the equalizer in each layer is 
connected to a common deinterleaver. which feeds a common decoder, and 
in which a common interleaver receives signals from the deinterleaver and 
decoder, and is coupled to each equalizer in order to provide interleaved 
signals to the equalizer. 

1 7. A system according to claim 1 6 in which the interference canceller 
which receives output from the layer preceding each said other layer receives 
signals from the equalizer and from the Interference canceller In said layer 
preceding each said other layer. 

1 8. A system according to claim 1 2 in which each space-time equalizer 
performs equalization using a minimum mean-square error criterion. 

1 9. A system according to claim 1 2 in which the number of transmit 
antenna elements to which the transmitter is coupled is less than the number 
of receiver antenna elements to which the receiver is coupled. 

20. A system according to daim 12 in which the transmitter is coupled to N 
transmit antenna elements, the receiver is coupled to M antenna elements, 
and M is greater than N. 

21 . A system according to claim 20 in which the receiver is adapted to be 
coupled to at least one set of M receiver antenna elements out of K available 
receiver antenna elements. 

22. A system according to claim 12 in which the receiver is adapted to 
select the sequence in which infonnation from the receiver antenna elements 
is to be processed. 

23. A communications system, comprising: 
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a radio transmitter coupled to a stream of information and to a plurality 
of transmit antenna elements, said transmitter adapted to apportion a portion 
of the stream of information to each transmit antenna element by interleaving 
said portions of the information stream among the transmit antenna elements; 
and 

a radio receiver coupled to a plurality of receiver antenna elements, 
comprising a plurality of layers for processing signals received by the receiver 
elements, each layer comprising: 

a space-time equalizer, a deinterieaver and a decoder, the space-time 
equalizer in a first layer of the plurality of layers coupled to each of the 
receiver elements, the space-time equalizer in each of the other layers of the 
plurality of layers coupled to each of the receiver elements and to an 
interference canceller which receives output from the decoder in the layer 
preceding each said other layer, each space-time equalizer coupled to the 
deinterieaver in its layer, each deinterieaver coupled to the decoder in its 
layer; 

the equalizer in each layer adapted to perform minimum mean-square 
error equalization to signals being processed; and 

an output for said stream of information coupled to the decoders in 
each of said layers. 

24. A system according to claim 23 in which the transmitter is adapted to 
interieave portions of the infomiation stream among the ti^nsmit antenna 
elements randomly. 

25. A system according to claim 23 in which the transmitter is adapted to 
interieave portions of the infomiation stream among the transmit antenna 
elements pseudo-randomly. 
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26. A system according to claim 23 in which the number of transmit 
antenna elements to which the transmitter is coupled is less than the number 
of receiver antenna elements to which the receiver is coupled. 

27. A system according to claim 26 in which the transmitter is coupled to N 
transmit antenna elements, the receiver is coupled to M antenna elements, 
and M is greater than N. 

28. A system according to claim 27 in which the receiver is adapted to be 
coupled to at least one set of M receiver antenna elements out of K available 
receiver antenna elements. 

29. A system according to claim 23 in which the receiver is adapted to 
select the sequence in which information from the receiver antenna elements 
is to be processed. 

30. A process for communicating an infomiation stream using a radio 
transmitter and a radio receiver, including: 

a. coupling a radio transmitter to the information stream and to a 
plurality of transmit antenna elements, and interleaving portions of the 
information stream among the transmit antenna elements: 

b. transmitting said portions of the information stream; 

c. coupling a radio receiver to a plurality of receiver antenna 
elements to receive the transmitted information stream; and 

d. processing the information stream in a plurality of processing 
layers, comprising: 

(i) in a first layer, coupling the receiver antenna elements to 
an equalizer and space-time processing the signals from the receiver antenna 
elements in the equalizer; 

deinterleaving the output from the equalizer; 
decoding the deinterleaved output from the equalizer; 
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feeding the decoded information to a common output for 
tlie information stream; and 

(ii) in each successive layer, coupling to an equalizer the 
output from an interference canceller that is fed by output from the preceding 
layer and space-time equalizing said output from said interference canceller; 

deinterleaving the output from the equalizer; 

decoding the deinterleaved output from the equalizer; 

and feeding the decoded infomiation to a common output 
for the information stream. 

31 . A process according to claim 30 in which steps of deinterleaving the 
output from the equalizer, decoding the deinterleaved output from the 
equalizer and feeding the decoded infomiation to a common output for the 
information stream are perfomied inside each layer and further comprising, in 
each layer, reinterleaving deinterieaved and decoded output from said layer, 
and feeding the reinterieaved signal to the equalizer in said layer thereby 
allowing soft decisions about infomiation being processed iteratively by said 
layer to be fed back and forth between said equalizer and said decoder in said 
layer. 

32. A process according to claim 30 in which the equalizer in each layer 
performs minimum mean-square error equalization. 

33. A process according to claim 30 in which said interleaving is perfomied 
randomly. 

34. A process according to claim 30 in which said Interleaving is perfomied 
pseudo-randomly. 
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35. A process according to claim 30 in which coupling of said receiver to 
said receiver antenna elements includes selecting a set of receiver antenna 
elements from a larger group of receiver antenna elements. 

36. A process according to claim 30 in which coupling of said receiver to 
said receiver antenna elements includes coupling to a set of receiver antenna 
elements selected from a larger group of receiver antenna elements. 

37. A process according to claim 36 in which said coupling further includes 
selecting the sequence in which signals from receiver antenna elements are 
to be processed. 
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Fig. 2. Space-time DDFSE with MAP processing 
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Fig. 3. Peifonnance of bit-interieaved 8PSK with rate- 1/3 convolutional 
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Soft decisions. 2 iterations. Quasi-static flat Rayleigh feding channel. 
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